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SUMMARY 


This paper reviews the enhancement to NASTRAN program performed by 
NUK (Nippon Univac Kaisha , Ltd.) added to Level 15.5. Features discussed include 
intermediate checkpoint— restart in triangular decomposition, 1/0 improvement, 
multibanked memory and new plate element. The first three improvements provides 
the capability to solve significantly large size problem, while the new elements 
release the analyst from the cumbersome work to constrain the singularities 
caused by the lack of stiffness of inplane rotation of old plate elements. 


INTRODUCTION 


Since 1974, NUK has been maintaining and developing a NASTRAN based on 
COSMIC level 15.5 and distributing it to the UNIVAC customer in Japan and Europe 
as SPERRY/NASTRAN in co-operation with Sperry Support Services. In Japan, more 
than 35 customer are using our version of NASTRAN. 

Since the leasing policy and export restrictions of source code were placed 
on post level 15.5, our motivation to offer SPERRY/NASTRAN is to provide state- 
of-the-art analytical capabilities incorporated in level 16.0 or later and error 
corrections in a timely manner to the UNIVAC customers outside U. S. A. In 
addition to maintaining a SPERRY/NASTRAN, we are also publishing a Newsletter 
each time the new version in released, providing user training seminar 8 times 
a year and developing and maintaining pre and post processors. 

These improved features are listed in Table 1 and some of them are discussed 
m this paper • 


DC0MP2 


In static analysis, the computing time associated with triangular decomposition 
forms 70-80% of total time if the number of grid points exceeds 3000. Since 
a checkpoint can be taken only after the completion of the module, if 
the decomposition is interrapted by unexpected error such as max time, dish fault, 
system hung up, the computation up to this point of error is completly lost. 

In this case, restart run can save only 10-15% of total run time. The use of new 
module DC0MP2 provides a method by which any number of intermediate checkpoints 
can by taken during symmetric matrix decomposition. The module supervises 
the existing symmetric decomposition routine (SDCOMP) by interrupting 
the factoring process at intervals specified by the user. 
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The new module, DC0MP2 , must be applied to an analysis through a rigid 
format DMAP alter. (Fig 1) 

DMAP Calling Sequence 

DC0MP2 KLL / LLL, ULL, SCR / V, N, NPARA / V, N, MPARA $ 

Input Data Block 

KLL - Partition of stiffness matrix - 1 set 

Note : KLL is assumed to be symmetric and may not be purged. 


Output Data Blocks 


LLL - Lower triangular factor of KLL - 1 set. 

ULL - Upper triangular factor of KLL - 1 set. 

SCR - Scratch Data Block for checkpoint/restart, contains the front 

matrix currently held in open core. 


Notes 


1. LLL and ULL may not be purged. 

2. ULL is not a standard upper triangular matrix. Its format is 
compatible only for input to subroutine FBS. 


Parameters 

NPARA - Input/output - integer - no default 
= -1 No operation is taken 
= 0 Normal decomposition 

= +N Number of intermediate checkpoints required. 

MPARA - output - integer - no default, MPARA contains the number of 
checkpoints completed. 


Notes : 1. The number of checkpoints required, NPARA, is decreased by 1 

each time an intermediate checkpoint is made. 

2. The number of checkpoints made, MPARA is increased by 1 
each time an intermediate checkpoint is made. 

3. Both parameters are saved in data block XVPS and used at the 
time of restart to determine current location in the 
decomposition procedure. 

Method 


The general procedure for utilizing the intermediate checkpoint feature, 
is to replace the existing symmetric decomposition instruction, e g. RBMG2 with 
the new module DC0MP2 . This is accomplished by using a set of DMAP alters for 
rigid format. 
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The new module contains a parameter, MPARA, which is used to specify 
the number of intermediate checkpoints to be taken. The user must determine 
before the first execution, the value of NPARA. If a restart is required of 
a partially decomposed matrix, the value of NPARA supplied in the DMAP instruction 
is replaced by the value in the XVPS data block, which can be used to determine 
at which row in the KLL the last checkpoint occured. 

It may be noted that the checkpoint dictionary provided after each 
intermediate checkpoint, specifies a re-entry into DMAP instruction 90, however, 
the use of DMAP alter forces re-execution of the inserted module. This action 
assured that the restart will re-enter the DMAP instruction 89. When the value 
of NPARA is reduced to 0 a normal exit from the SDCOMP routine will be made and 
NPARA set to -1. If a restart is made after the decomposition has been 
completed, NPARA = -1, the DCOMP2 module gives an immediate return, therefore, 
it is not necessary to remove the DMAP alter statements. 

Fig. 1 describes the meaning of the parameter NPARA which determines how 
the matrix is broken into an rows for decomposition. At the first entry to 
subroutine SDCOMP, an is decided from NPARA and the matrix size N. When 
the decomposition is completed for every AN rows, SDCOMP copies the values in open 
core and scratch file (in case spill occures) to SCR, then gives an alternate 
return to DC0MP2 . DCQMP2 calls XCHK to write LLL, ULL and SCR on NPTP and after 
returning from XCHK, calls SDCOMP again to continue the decomposition and 
increase the value of MPARA by 1. 

In the restart run, SDCOMP recovers open core from SCR and resumes 
decomposition from MPARA *an + 1 st row of matrix KLL. 


IMPROVED I/O 


SPERRY/NASTRAN has two improvements in I/O routine, asynchronous I/O and 
multiple I/O block read. 

Asynchronous I/O 

In conventional NASTRAN, one buffer area is assigned to each I/O unit and 
actual I/O request will be initiated when the buffer area is filled in case of 
write operation. The execution of NASTRAN is suspended until all data in buffer 
area is transfered to external storage. 

In most of computer systems, asynchronous I/O capability is provided which 
makes it possible to process I/O operation and non-I/O operation simultaneously. 
To take full advantage of this capability, I/O routines of NASTRAN (GINO) were 
drastically rewritten. 

The new GINO routine divides the I/O buffer into two parts. One of them is 
used for the data transfer for the higher level subprograms and the other is used 
to transfer the data from/to the external storage. (Fig 2-1) 

This new capability has no effect to computer time, but the elapsed time 
(wall clock time) may be reduced to 2/3 of the old. 


3 



MREAD (Multiple Read Routine) 


In real eigenvalue analysis , Invers Power Method is most papulary used due 
to its efficiency. Iterative procedure of this method is described as follows. 


step 

1 

2 

3 

4 

5 

6 
7 


A 0 — X { 

K- X iM ->D 
D -*L-U 
M-U a ^V a+1 
(L*U)' l V rl+1 -LWi 
Wa+i/c “* 'Un+i 


description 

shift 

form dynamic matrix 
triangular decomposition 
matrix multiplication 
substitution 
normalization 


convergence check 


where 


K : stiffness matrix 

M : mass matrix 

X : estimated eigenvalue 

u, v, w : iteration vector 


Generally, for the extraction of one eigenvalue, step 4, 5 will be performed 
8 ~ 10 times and step 3 will be done once. Regarding to computer time and memory 
size, step 3 and step 4, 5 have quite different characteristics described below. 


step 

CPU time 

I/O time 

memory size 

3 

0 (NB 2 ) 

0 (NB) 

B 2 

4, 5 

0 (NB) 

0 (NB) 

14 X N 


where N : size of matrix 

B : semi-band width of matrix 

As easily shown from this table, CPU time is dominant in step 3 while 
I/O time is dominant in step 4, 5, and the memory size in step 4, 5 is much less 
than that of step 3. 

To reduce the I/O time in step 4, 5, we can remember the fact that I/O time 
depends on not only the number of words transfered but also the number of times 
I/O operations requested. 
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The new open routine (MOPEN) determines the number of I/O blocks held in 
open core. The lowest I/O routine (GINOIO) was changed so that the one I/O 
operation fills all of these I/O blocks and the I/O request of one block to 
GINOIO is considered to be a change of pointer to current I/O block untill all 
the I/O blocks held in open core were exhausted. 

Since the size of I/O block was not changed, the size of actual I/O request 
can be determined according to the size of open core available. 

This new capability is also applied to FBS (forward backward substitution) , 
TRD (transient response displacement) and TRHT (transient response heat transfer) . 
The numerical examples described in Fig 2.2 thru Fig 2.5 shows us that the I/O 
time will be reduced to 1/3 to 1/5 of the old. 


MULTI BANKED MEMORY 


At this time Univac system has no virtual storage capability, and 
the addressing limit of 262 KWD. Therefore, if the summation of B (semi-band width) 
and C (number of active columns) of stiffness matrix exceeds nealy 470, front 
matrix generated during the symmetric real decomposition (performed by SDCOMP) 
can not be held in the main memory. Some portion of this overflowed area 
(called spill') is processed in the area for active column, but remainder are 
stored in disk area. The processing of this spilled portion requires frequent 
I/O operation which results significant decrease of the execution efficiency. 

On the other hand, the model size which is indicated by the number of grid 
point and number of elements has been increasing significantly due to the require- 
ment of the engineers to obtain more precise simulation result. With the aid of 
powerful, easy to use preprocessors, the size of finite element model can easily 
exceeds the above limit. 


To override this defect, SPERRY/NASTRAN provides automatic memory expansion 
capability to use banked memory as internal file up to 4MWD. The spilled portion 
of the front matrix is not stored in disk space but in this banked memory 
(called ADDITIONAL CORE) . Since the data transfer to (or from) this banked 
memory is no longer an I/O operation but a simple store (or load) operation, 
the I/O time was significantly reduced. (Fig 3.1, 3.2) 

For the implementation of this capability, following subroutines are 
developed. 

OPENX - According to the number of words requested, reserve the banked 
memory via MCORES and external disk space if necessary. 

READX/WRITEX - Transfer the data from/to the banked memory. 

CLOSEX - Release the banked memory via LCORES. 

To examine the efficiency of this new capability, comparisons between 
the new and old NASTRAN are done. Because the comparison with large size problem 
which causes spill from 262 KWD requires much CPU time, middle size problems are 
examined restricting program size less than 200 KWD to cause spill processing. 
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From the results shown in Fig 3.3 and Fig 3.4, we can conclude that 
the significant reduction in I/O time is achieved while the CPU time reduction 
is a little. Since the number of words of spilled area is proportional to 
the square of B+C, the effect of this capability becomes more apparent as 
the problem size increases. 


NEW PLATE ELEMENTS 


NASTRAN assumes 6 degrees of freedom (d.o.f.) per grid point. However, all 
the original elements have 5 or less d.o.f. s at their grid point. Though the lack 
of stiffness of each grid point is checked and informed by NASTRAN automatically, 
the generation of appropriate constraint data for these singularities is 
a difficult work especially in the following case. 

inclined flat plate model 

curved shell which has a big radius of curvature 

Some modules are developed to constrain these singularities by assuming that 
all the d.o.f. which remains singular should be constrained by SPC processing. 

Some other modules only made the card images to constrain such d.o.f. s by SPC or 
MPC processing. Considering the difficulties of such function due to 
the existence of MPC, we thought it better to incorporate the plate element which 
has 6 d.o.f.s at their grid points. 

Two plate elements named TRIA3 and QUAD4 are incorporated. These elements 
are formulated from 'Pian's hybrid element*. Assuming the displacement function 
on each side of the element, the stress function within the element, 
the equilibrium equation on grid point is formulated based on the principle of 
complemental virtual work. Membrane element and bending element are produced 
independently and these are combined without coupling each other. Following are 
the outline of triangular element. 

Membrane Element 


The degrees of freedom on a grid point are u, v and 9z. U and v are 
the components of displacements paralle to the axis of the local coordinate 
system. 0z is the average value of the rotation of side, not the ratational angle 
of grid point itself. The displacement functions are defined on the side and 
shown as follows. 

displacement tangential to side (Us) = linear function 
displacement normal to side (Un) = cubic function 
rotational angle (© = £Un/6r) = quadratic function 

where 

r is parameter on side 
n is normal direction to side 
s is tangential direction to side 
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The stress function is defined within element and as follows. 
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where ( C 1. C 2. C 3 ) are the area coordinates. 

In order to ^afisfy the equilibrium equation within element, 2 components 
of 9 {<7)2. {0)3 are dependent. 

Bending Element 

The degrees of freedom on the grid point are W, ©x and Gy. w is 
the deflection normal to the x, y plane. 

The displacement functions are defined on the side and shown as follows. 

deflection W = cubic function 

side direction slope GW/Gs = quadratic function 
normal slope to side G W/Gn = linear function 

The stress function is defined within element as follows. 


{*) 




C 3 f { * 3 1 1 

C 3 { { O )2 ^ 

c 3 J i< ^ } 3 j 


f m x 

{ <J } i = | l«y y 

[ n»xy j at grid point i 


where nix = JV x *2 d z 

the integration takes place over direction to plate thickness. 


9 components of stress are independent. 

Composite Shell Element 

Shell element is composed of membrane element and bending element without 
coupling each other. 
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Quadrilateral Element 


Quadrilateral element (QUAD 4 ) is composed from the 4 overlapping TRIA3 
elements. (Fig 4.1) 

Numerical Evaluation 

The model and loading condition are shown in Fig 4.2 and the results are 
shown in Fig 4.3 thru Fig 4.5. From these results, following conclusions are 
obtained. 

(1) Inplane bending 

Because the elements TRIA2 and QUAD 2 absorbed the inplane bending as 
the form of shear, the deformed shape of such elements was not so good. In 
this viewpoint, TRIA3 and QUAD 4 are recognized to be improved, and moreover 
the stress calculated falls in the safety side. 

(2) Out of plane bending 

Regarding the displacement, no visible difference between TRIA3 and 
TRIA2 is recognized. But the distribution of reaction forces reverses. 

(3) Inplane tension 

The TRIA2 and QUAD 2 elements shows good results, on the other hand, 
the solution of new elements at the end side causes disorder. 

(4) Summary 

The accuracy of new elements TRIA3 and QUAD4 is better than TRIA2 and 
QUAD 2 in many cases. Moreover, introduction of ©z as additional d.o.f. makes 
lose the necessities of constraint by SPC or MPC card, troubles on modeling 
will be reduced. 


CONCLUSIONS 


Several improvements and enhancements performed by NUK (Nippon Univac Kaisha, 
Ltd.) are described in the paper. These modifications increase the usefulness 
and efficiency of the NASTRAN program. 

Due to the leasing policy and export restrictions placed on post 15.5, and 
the requirement of Japanese customer such as, timely error corrections, response 
to technical questions which sometimes need the understanding of source code and 
Japanese manual, we will continue to develop our NASTRAN. But if NASA can consider 
alternatives to the current policies so that the Japanese company can respond 
above requirements, the number of users of COSMIC/NASTRAN will increase steadily. 
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The list of current development priorities for our NASTRAN are : 

• FEER method 

• HEXA element - 8 to 20 nodes (variable) 

• PLOAD3 , PLOAD4 - pressure load on solid surface 

• BAR element including warping 

• Improved solid element - incompatible mode 

• Response spectrum 

• Output of strain and strain energy 
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Table 1. Major Improvements Added to Level 15.5 



Item 

Description 

1) 

Isoparametric 

Elements 

IS2D4, IS2D8, IS3D8, IS3D20 

2) 

RF14 

Thermal transient-structural static 

3) 

RF13 

Normal mode analysis with differential stiffness 

4) 

ELBOW 

curved beam element 

5) 

PLOADl 

distributed load on BAR element 

6) 

TRIA3, QUAD 4 

Plate element with 6 d.o.f.s at each grid point 

7) 

REAR, RBE2 

Rigid elements 

8) 

AUTOSPC 

Automatic constraint by SPC 

9) 

READ 

Append capability 

10) 

RF4 

Iterative procedure 

11) 

NOLIN 

Iterative procedure 

12) 

GPSC 

SPC , MPC generator 

13) 

DCOMP2 

intermediate checkpoint-restart in SDCOMP 

14) 

GINO 

asynchronous I/O, multiple block read 

15) 

SDCOMP 

banked memory 

16) 

FES , INVFBS 

non-transmit type UNPACK 

17) 

TRIA6 , QUAD 8 

higher order shell element 

18) 

PENTA 

15 node wedge element 

19) 

SPRNG 

Spring element 


Remark ) Items 1) thru 4) are performed by Sperry Support Services. 
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Following DMAP ALTERS are required for Rigid Format I. 
ALTER 3 

FILE LLL=APPEND/ULL=APPEND/SCR=APPEND $ 

ALTER 89,89 

DC0MP2 KLL/LLL,ULL,SCR/V,N,NPARA=2/V,N,MPARA $ 
ENDALTER 


Fig. I Intermediate Checkpoint-restart in SDCOMP 
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Fig. 3.1 Spill logic in SDCOMP 
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Fig. 3.2 Rank structure 




(a) Case 



Case A 



(b) Case B : old : 200KWD 

new : 200KWD + ADDITIONAL CORE 43KWD 


to 

o 


Time 



1 SUP in NASTRAN 


SUP in SDCOMP 


Case B 


No . of 
GRID points 

1061 

No • of 
Elements 

900 

B + C 

450 

N 

3966 


SUP in NASTRAN 



old 

new 

TOTAL 

1.0 

0.541 

SUP 

CPU 

0.264 

0.260 


SUP in SDCOMP 



old 

new 

TOTAL 

0.615 

0. 136 

SUP 

CPU 

0. 122 

0. 1 17 


Fig. 3.4 Efficiency test - 2 
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Fig. 4.2 Test Model for TRIA3 and QUAD4 element 
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Fig. 4.5 Test Results of New Element - Inplane Tension 
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